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ABSTRACT 

We describe an attempt to reconstruct the initial conditions for the formation of 
cosmological large-scale structure, under the assumption of gravitational instability 
' in a Gaussian density field. Information on the power spectrum of the primordial 

, fluctuations is provided by a variety of autocorrelation and cross-correlation analyses 

■ on samples of different classes of galaxy and galaxy clusters. These results differ from 

the desired linear power spectrum because of three modifying effects: bias, nonlinear 

y—( , evolution and redshift-space distortions. We show how the latter two effects can be 

^ • corrected for analytically, allowing the linear mass spectrum to be recovered provided 

t***- \ the bias is independent of scale for a given class of galaxy. We argue that this is a 

I/"") . good assumption for large scales, which is well verified in practice. 

We apply this method to eight independent datasets, and obtain excellent 
agreement in the estimated linear power spectra for wavelengths A > 10 ft. _1 Mpc, 
given the following conditions. First, the relative bias factors for Abell clusters, radio 
galaxies, optical galaxies and IRAS galaxies must be in the ratios b A : b R : b : b T = 
4.5 : 1.9 : 1.3 : 1, to within 6 per cent rms. Second, the data require a significant 

6 ■ degree of redshift-space distortion: f2 - 6 /6 : = 1.0 ± 0.2. Third, low values of ft and 

bias are disfavoured because nonlinear evolution would spoil the agreement in shape 
q . between galaxy and cluster power spectra. The amplitude of the preferred linear power 

!_h ' spectrum is only weakly dependent on Q and agrees well at large wavelengths with the 

\ normalization demanded by the COBE data for a scale-invariant primordial spectrum, 

C$ • provided Q = 1 and gravity-wave anisotropics are negligible. In this case, the shape of 

the spectrum is extremely well described by a CDM transfer function with an apparent 
. ,-h , value of the fitting parameter Qh = 0.25. Tilted models, for which inflation requires 

■ a large gravity-wave contribution to the COBE data, predict too little power at 100 
?—i ' Mpc wavelengths. 
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1 INTRODUCTION 

The simplest hypothesis for the origin of the large-scale 
structure of the universe is that it is the result of the opera- 
tion of gravitational instability on small initial density per- 
turbations. On grounds of economy, these are often assumed 
to have the random-phase character common in noise pro- 
cesses, and hence to form a Gaussian random field. This has 
been the standard picture for structure formation for the half 
century since the pioneering studies of Lifshitz, given added 
motivation more recently by inflationary theories in which 
the initial perturbations are supplied by quantum fluctua- 
tions at early times. 

If this picture is correct, the only quantity needed for a 
complete statistical description of the cosmological density 



field is the power spectrum of the fluctuations at some early 
time (or its linear-theory extrapolation to the present). Ob- 
servationally, much progress has been made in recent years 
towards the goal of determining the power spectrum, fulfill- 
ing the programme outlined by Peebles (1973). New gener- 
ations of deep redshift surveys have allowed the clustering 
of various classes of galaxy to be determined up to the con- 
tribution from wavelengths of several hundred Mpc. In par- 
allel, new analysis techniques have been developed in order 
to extract the long-wavelength portion of the power spec- 
trum more sensitively (e.g. Feldman, Kaiser & Peacock 1993 
[FKP] and refs therein). 

The intention of this paper is to compare various re- 
cent determinations of galaxy clustering, and to see if there 
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exists a single consistent picture for the underlying mass 
fluctuations. It is an updated version of a previous attempt 
in this direction (Peacock 1991), but with several impor- 
tant improvements in addition to a great increase in data. 
In essence, there are three filters that cause the observed 
clustering properties of galaxies to depart from the desired 
linear mass power spectrum: 

(i) Nonlinear evolution. On small scales, perturbation the- 
ory fails and the mass power spectrum departs in a com- 
plicated way from a linear extrapolation of the initial 
conditions. 

(ii) Redshift-space effects. Because 3D datasets use redshift 
as a radial coordinate, the apparent density field that 
results is distorted through the existence of peculiar ve- 
locities. Even for perfect data, the redshift-space power 
spectrum is not the same as that in real space. 

(iii) Bias. The fact that different species of galaxy follow the 
mass distribution with different degrees of fidelity is a 
major problem in relating observations to theory. To 
correct for bias in principle requires a detailed model 
for how the effect arises. 

None of these effects have been handled very thoroughly 
in previous work. The issue of bias is the most difficult, and is 
really only tractable on large scales where the degree of bias 
can be assumed constant. It is now possible to have a better 
idea of where this approximation is valid, and we discuss 
this issue in Section 2. Previously, nonlinear distortions were 
either ignored or treated by comparing nonlinear data with 
an evolved JV-body model. However, thanks to the insight 
of Hamilton et al. (1991; HKLM), it is possible to correct 
the data for the effects of nonlinearities. We discuss their 
method in Section 3 and give a number of generalizations. 
Redshift-space distortions have usually been treated by a 
simple scaling of amplitude analyzed by Kaiser (1987), but 
this is inapplicable on small scales. We give an improved 
analysis in Section 4. 

Given a method for treating the practical distortions 
of power spectra, there are two possible approaches. There 
is an honourable tradition which states that it is better to 
apply any corrections to the theory under test, and to com- 
pare the modified model with the raw data. Nevertheless, we 
shall do the opposite and estimate the linear spectrum by 
correcting the data. This has two advantages: no model is 
needed, and the power spectrum can be found empirically; 
by comparing the various estimates, we can then see directly 
if all datasets are consistent with each other. In Section 5, 
we assemble the most recent power-spectrum data and ap- 
ply the above tools to deduce the linear power spectrum. 
This empirical reconstruction is compared with a variety of 
a priori models in Section 6, and the main points of the 
paper are summarized in Section 7. 

2 GALAXY AND CLUSTER CORRELATIONS 
IN GAUSSIAN MODELS 

2.1 Evidence for Gaussian fluctuations 

Since a good part of the analysis in this paper rests on the 
assumption of a Gaussian density field, we should start by 
considering the evidence that this is a good approximation. 

The evidence has to be gathered on large scales, be- 
cause nonlinear evolution inevitably induced non-Gaussian 



statistics on small scales, whatever the initial statistics. The 
most direct test was carried out by FKP, who looked at the 
distribution of power measured for individual modes in a 
power-spectrum analysis of the IRAS QDOT redshift survey. 
For a Gaussian field, such modes should have power values 
that are independently exponentially distributed. This was 
found to be the case out the limit of the statistics - powers of 
about 10 times the mean. This is not a complete test of the 
Gaussian hypothesis: it is equivalent to asking in real space 
whether the one-point density distribution is Gaussian. Fur- 
ther information is provided by higher-order fc-space corre- 
lations which test for independence of the modes. Neverthe- 
less, it is worth recalling that there have been suggestions 
that even this lowest-order test is badly violated. On the ba- 
sis of a pencil-beam redshift survey, Broadhurst et al. (1990) 
and Szalay et al. (1991) have suggested that there is gross 
non-Gaussian behaviour on large scales, based on the exis- 
tence of strong quasi-periodic power at a few wavelengths. 
There is no need to repeat here the counter-arguments given 
by Kaiser & Peacock (1990); it should suffice to note that the 
QDOT sample is deep enough that it encompasses several 
of the suggested periods in a large number of independent 
directions, yet no non-Gaussian signature is detected. 

Any initial Gaussian nature of the field is completely 
erased on very small scales, but on intermediate scales the 
field develops a skewness which can be analyzed perturba- 
tively (Peebles 1980). The observed degree of skewness ap- 
pears to be in accord with this prediction (Gaztahaga 1992; 
Bouchet et al. 1993), which gives further support to the 
Gaussian hypothesis. This is not a definitive test, since most 
bias mechanisms will induce skewness; what is observed is 
a mixture of this effect with primordial skewness, plus the 
effects of gravitational evolution. Nevertheless, simple Gaus- 
sian models without a strong degree of bias do account for 
the data well. 

A variety of other tests have been suggested, includ- 
ing the topology of isodensity surfaces (Hamilton, Gott & 
Weinberg 1986; Coles & Plionis 1991; Moore et al. 1992) 
and the one-point distribution of the velocity field (Nusser 
& Dekel 1993; Kofman et al. 1993). It is fair to say that 
none of these methods has produced any evidence against 
primordial Gaussian statistics. However, as usual in statis- 
tics, it is necessary to choose the null hypothesis with care. 
It is certainly the case that not all tests are necessarily very 
powerful; the central limit theorem means that a variety 
of non-Gaussian processes may yield nearly Gaussian be- 
haviour in experiments where limited resolution averages 
over different regions of space (Scherrer 1992). Thus, some 
of the more popular models based on topological singulari- 
ties (strings, textures etc.) may still be allowed by existing 
data (e.g. Gooding et al. 1992). It will however be inter- 
esting to see such theories confronted with the FKP result, 
particularly as such statistics will become more demanding 
as datasets increase. 

For the present, it is enough to note that there is em- 
pirical reason to believe that the statistics of the large-scale 
density field are close to Gaussian. If this is so, then there 
are consequences for the clustering of galaxy systems, as dis- 
cussed below. As we will see, these predictions are verified in 
practice, which is one further piece of supporting evidence 
for the Gaussian picture. 
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2.2 Bias in galaxy and cluster correlations 

In a Gaussian model, the correlations of different classes 
of galaxy system can be directly related to the underlying 
density field, with the power spectra being proportional on 
large scales 

A 2 (k) = 6 2 ALss(fc). (1) 

Here and below, we shall use a dimensionless notation for 
the power spectrum designed to minimise uncertainties from 
differing Fourier conventions. In words, A 2 is the contribu- 
tion to the fractional density variance per bin of In fc; in the 
convention of Peebles (1980), this is 

A2 «^=(4 4 ^ 31412 - (2) 
The justification for the above relation is the assumption, in- 
troduced by Kaiser (1984), Peacock & Heavens (1985) and 
Bardeen et al. (1986; BBKS), that the sites of massive ob- 
jects such as clusters can be identified at early times as high 
peaks in the linear density field. Such a scheme might be 
termed Lagrangian bias, and b is called a bias parameter. 
This is rather sloppy: biased galaxy formation usually means 
the situation where light does not trace mass in the universe, 
but clusters would still be more correlated than the mass 
even if the galaxy distribution followed the mass exactly. 
However, this useage is too firmly embedded in the litera- 
ture to make it worth fighting; we will therefore describe the 
enhanced correlations of clusters as bias. 

As the density field evolves, the initial statistical clus- 
tering in Lagrangian space is supplemented as dynamics 
moves objects from their initial sites. Owing to the equiv- 
alence principle, all objects move in the same way, so that 
the overall observed clustering in Eulerian space is 

1 + feulcr = (1 + 6 Lagrange ) (1 + Adynamics)- (3) 

In the linear regime, we therefore have 

^Lagrange — (b - l)(5 d yn amies ■ (4) 

Bond & Couchman (1988) showed how this decomposition 
could be used to compute exact total correlations, under 
the assumption that the dynamical evolution obeyed the 
Zeldovich (1970) approximation. Mann, Heavens & Peacock 
(1993) applied this method to the calculation of cluster cor- 
relations. In practice, the statistical contribution tends to 
dominate for scales larger than the filter size used to define 
clusters (a few Mpc); the cluster distribution has not un- 
dergone strong dynamical evolution, and most clusters are 
close to their original sites. 

Although the idea of Lagrangian bias was borrowed by 
BBKS from its cluster origins and applied as a model for 
biased galaxy formation, it may be more fruitful to think 
of galaxy bias in a purely Eulerian way, where the density 
of galaxies is some function of the final mass density. This 
has long been advocated by Einasto and collaborators, with 
galaxy formation being suppressed in low-density regions 
(Einasto, Joeveer & Saar 1980). More recently, studies of 
the operation of dissipation in numerical simulations has 
produced a more direct physical justification for relating the 
galaxy and mass density fields through a single nonlinear 
function (Cen & Ostriker 1992). 

These contrasting views of the origin of cluster and 
galaxy bias lead to rather different approaches when at- 



tempting to use clustering data to infer the mass fluctua- 
tions. For clusters, the statistical bias is dominant, and we 
may assume that the clusters reflect mainly the initial con- 
ditions. Conversely, it is reasonable to believe that galaxies 
come close to tracing the mass. Many studies have indicated 
that different classes of galaxy follow the same 'skeleton' 
of voids, filaments, walls & clusters, while differing most 
markedly in regions of high density (e.g. Babul & Postman 
1990, Strauss et al. 1992). This last effect may not be so 
important: despite having densities differing by factors of 
close to 10 in rich clusters, we shall see below that IRAS 
and optical galaxies have bias factors within about 30 per 
cent of each other. This is analogous to the findings of Cen 
& Ostriker (1992): even though their model has a highly 
nonlinear dependence of galaxy density on mass density for 
high densities, the power spectra are proportional on most 
scales, even down to the point where A 2 (fc) ~ 1. In any case, 
it is important to keep in mind that we are not interested in 
exactly how a given class of galaxy does or does not follow 
the density field: a variety of different bias schemes could 
give the same galaxy power spectrum, even though the light 
distributions would be model dependent. 

The above discussion motivates the assumptions that 
we shall use below to make estimates of the linear power 
spectrum. We shall adopt the extreme approximations that 
the cluster distribution contains information only about 
the linear power spectrum, whereas the galaxy distribution 
mainly measures the nonlinear density field: 

A 2 = b% A 2 (5) 
A| = fe|A 2 L (6) 

A further way of understanding this distinction is to con- 
sider the following illustrative model: populate the universe 
with identical spherical protocluster perturbations. At some 
critical time, these will turn round and virialize, producing 
a large excess of small-scale power in the nonlinear den- 
sity field. However, at this time, the cluster centres will 
still be weakly perturbed: the existence of the small-scale 
power is what allows us to say that clusters are present, but 
there is no reason to expect this power to manifest itself in 
many close pairs of cluster centres. Ultimately, our hypoth- 
esis must submit to the test of numerical simulation, but for 
the present it should certainly be closer to the truth to say 
that clusters respond to the linear power spectrum, rather 
than to the nonlinear one. 

Although the above bias factors are calculable given a 
specific bias model, we shall treat them as unknowns to be 
determined from the data. It is clear that the assumption 
of constant bias factors cannot be exact, and will certainly 
break down at small scales. To some extent, the domain 
of validity can be found empirically, by seeing whether it is 
possible to make a consistent picture in this way from all the 
available data. In practice, we shall use data at wavenumbers 
k < 0.6 h Mpc -1 , i.e. wavelengths A > 10/i -1 Mpc (as usual, 
h = tfo/lOOkms^Mpc -1 ), so we are only dealing with the 
large-scale mass distribution. 

There is a third way in which the mass power spectrum 
may be inferred, which is to use cross-correlation data from 
two catalogues, in addition to the respective autocorrela- 
tions. In this case, it is not so obvious whether we measure 
more nearly the linear or nonlinear correlations. In practice, 
we shall use data on large enough scales that the distinction 
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will not be so important; we therefore assume a relation to 
linear theory 



&c&gA l 



(7) 



This provides a useful consistency test of our assumptions: 
the cluster-galaxy cross-correlation should be the geometri- 
cal mean of the separate auto-correlations. 



3 NONLINEAR EVOLUTION OF POWER 
SPECTRA 

To implement the above assumptions requires some way of 
relating linear and nonlinear power spectra; until recently, 
this would have required JV-body modelling. However, in a 
marvelous piece of alchemy, Hamilton et al. (1991; HKLM) 
gave a universal analytical formula for accomplishing the 
linear <-» nonlinear mapping. The conceptual basis of their 
method can be understood with reference to the spherical 
collapse model. For Q = 1 (the only case they considered), 
a spherical clump virializes at a density contrast of order 
100 when the linear contrast is of order unity. The trick 
now is to think about the density contrast in two distinct 
ways. To make a connection with the statistics of the density 
field, the correlation function £(r) may be taken as giving 
a typical clump profile. What matters for collapse is that 
the integrated overdensity reaches a critical value, so one 
should work with the volume-averaged correlation function 
£(r). A density contrast of 1 + 8 can also be thought of as 
arising through collapse by a factor (1 + S) 1 ^ 3 in radius, 
which suggests that a given non-linear correlation ( NL (r NL ) 
should be thought of as resulting from linear correlations on 
a linear scale 



r L = [1 + £NL(rNL)] 1/3 r N L. 



(8) 



This is one part of the HKLM procedure. The second part, 
having translated scales as above, is to conjecture that the 
nonlinear correlations are a universal function of the linear 
ones: 



Cnl^nl) = /nl[£l(j- l )]. 



(9) 



The asymptotics of the function can be deduced readily. For 
small arguments i< 1, / NL (i) ~ x; the spherical collapse 
argument suggests /nl(1) — 10 2 . Following collapse, £ NL 
depends on scale factor as a (stable clustering), whereas 
|l oc a 2 ; the l&rge-x limit is therefore / Nl (a;) oc x 3 ^ 2 . HKLM 
deduced from numerical experiments that the exact coeffi- 
cient was 



/ NL (a;) 11.68 x 3/2 



(10) 



and obtained a numerical fit that interpolated between these 
two regimes, in a manner that empirically showed negligible 
dependence on power spectrum. 

To use this method in the present application, we need 
two generalizations: we need to make the method work with 
power spectra, and we need the analogous results with / 
1. In principle, the translation between f(r) and A 2 (fc) is 
straightforward, but obtaining stable numerical results is not 
so easy. One route is to use the relations between £(r) and 

CM ' 



£(x) x dx 



(11) 



d[r :i qr)} 
d[r 3 ] ' 



(12) 



followed by the Fourier relations between £(r) and A 2 (fc) 



£(r)= / A 2 (fc). 



kr k 



2k 3 



A 2 (fc) = — I f(r) r 2 dr. 

ty /,. kr 



(13) 
(14) 



This approach is not so attractive. To obtain the nonlinear 
power spectrum from the linear one requires two numerical 
integrations, differentiation, followed by one further integra- 
tion. It is possible to do a little better by manipulating the 
above equations to relate A 2 (k) and £(r) directly 



£ (r) = / A 2 (fc) ^ „ [sinfcr — kr cosfcrl (15) 
Jo k (fer) 3 

A 2 (fc) = / f(r) r 2 dr .} , [sin kr — kr cos kr], (16) 
3tt J (kr) 

where the last relation holds provided that £(r) — ► faster 
than r -2 at large r (i.e. a spectrum which asymptotically 
has n > — 1, a valid assumption for spectra of practical 
interest). This looks better, since there are now only two in- 
tegrations required, and furthermore efficient methods exist 
for dealing with integrations with sin and cos weightings in 
the integrand. However, because the window function con- 
sists of the difference of two such terms, life is still not so 
easy: evaluating the two parts of the integral separately gives 
a result as a difference of two large numbers, which is thus 
generally of low accuracy. The most satisfactory practical 
procedure seems to be a mixture of the two possibilities: 
(i) evaluate a table of £l(t*) values for a given linear power 
spectrum by evaluating the oscillatory integral directly; (ii) 
transform to a table of £nl(»*) values using the HKLM pro- 
cedure; (iii) fit splines to the result and differentiate to get 
£nl(»"); (iv) Fourier transform to get A 2 1L (fc). The accuracy 
of the result can be improved in the final step by transform- 
ing £nl(j~) — which vanishes rapidly at large r, and 
then adding A 2 (k) to the answer. 

The above process is still rather time-consuming and in- 
elegant; it would be much better to make the HKLM method 
work directly in terms of power spectra, and this is usually 
possible. The main idea is that £(r) can often be thought 
of as measuring the power at some effective wavenumber: it 
is obtained as an integral of the product of A 2 (fe), which is 
often a rapidly rising function, and a window function which 
cuts off rapidly at high k. The answer can be approximated 
by replacing the exact window function by the Gaussian 
which is equivalent to second order in k: 



£(r) = A 2 (fc eff ) 



([n + l]/2)! 



l/(n + 3) 



i 

r 



(17) 
(18) 



where n is the effective power-law index of the power spec- 
trum. This approximation is within a few per cent of the 
exact integration provided n < 0. The effective wavenumber 
is insensitive to n, and is within 20 per cent of 2.4/r over the 
range — 2 < n < 0. In most circumstances, it is therefore an 
excellent approximation to use the HKLM formulae directly 
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to scale wavenumbers and powers: 



A NL (fe 



/ NL [A2(fe.)] 

1/3, 



[1 + A 2 L (fc NL )r 1 h 



(19) 
(20) 



Even better, it is not necessary that the number relating 
1/r and k c g be a constant over the whole spectrum. All 
that matters is that the number can be treated as constant 
over the limited range r NL to r L . This means that the devi- 
ations of the above formulae from the exact transformation 
of the HKLM procedure are only noticeable in cases where 
the power spectrum deviates markedly from a smooth mono- 
tonic function, or where either the linear or nonlinear spectra 
are very flat (n < —2). Even this is not obviously a problem, 
since the HKLM procedure itself is not exact and does not 
work so well for flat spectra, n < —2 (A.S. Hamilton, private 
communication) . Our approximation is illustrated in Figure 
1, which shows the result of nonlinear evolution on spec- 
tra with and without a short-wavelength cutoff. Whether 
the HKLM method actually applies to the first situation is 
an interesting question which we hope to investigate else- 
where. Other cases where our approximation would fail in- 
clude power spectra with the oscillations characteristic of 
pure baryon models. However, since the data studied here 
reveal no trace of such sharp features in the power spectrum, 
we may use the direct approximation for the nonlinear evo- 
lution of the power spectrum with confidence. 

It remains to generalise the result from the Q — 1 model 
considered by HKLM. This can be done partly analytically. 
The argument that leads to the / NL (x) oc x 3 ^ 2 asymptote in 
the nonlinear transformation is just that linear and nonlin- 
ear correlations behave as a 2 and a 3 respectively following 
collapse. If collapse occurs at high redshift, then Q — 1 may 
be assumed at that time, and the nonlinear correlations still 
obey the a 3 scaling to low redshift. All that has changed is 
that the linear growth is suppressed by some fi-dependent 
factor g(Q). It then follows that the large- a; asymptote of 
the nonlinear function is 



/ NL (z) -> 11.68 [g(n)]- 3 x 3/2 . 



(21) 



According to Carroll, Press & Turner (1992), the growth 
factor may be approximated almost exactly by 

g(Q) = ^fi m [Q 4 J 7 - O, + (1 + On/2)(l + fiv/70)] ~* - ( 22 ) 

where we have distinguished matter (m) and vacuum (v) 
contributions to the density parameter explicitly. We shall 
generally use f2 without a subscript to mean fl m hereafter. 

To interpolate between the expected nonlinear asymp- 
tote and the linear regime, numerical experiments are nec- 
essary. We therefore wrote a PM iV-body code, which was 
used to evolve a variety of initial spectra to a final state of 
given O m and fl v ; typically 64 3 particles and a 128 3 mesh 
were used. At a later stage of the investigation, we were able 
to check our results with the superior resolution provided by 
the AP 3 M code of Couchman (1991). It was also possible to 
compare with low-density CDM models published by Davis 
et al. (1985) and Kauffmann & White (1992). Our conclu- 
sion is that a near-universal behaviour analogous to that of 
HKLM does appear to exist for low-density models, at least 
for the linear spectra with power-law indices —2 < n < 
that we were able to test. We have produced the following 
fitting formula for the generalized / NL . This is designed to 



match the HKLM expression almost exactly in the £1 — 1 
limit, and to describe the main features of the alterations 
encountered in low-density models. The accuracy is approx- 
imately 10 per cent in terms of the deduced linear power A 2 
corresponding to a given A 2 ^, over the range 0.3 < g(Q) < 1. 



/nl(x) 



1 + 0.2f3x + (Ax) 



1/0 



(23) 



where the parameters are A = 0.84[g(fi)] ' 2 , a = 2/[g(Q)\, 
and (3 — 2g(Q). This fit says that the transition region be- 
tween the linear and nonlinear regimes is dominated by an 
/ml <x x 1+a power law, which becomes very steep for low- 
density models. This steepening has long been familiar from 
iV-body models, and the apparent power-law nature of the 
spectrum can been used as an argument against low-density 
models. We shall end up making a rather similar argument 
here. 

It is useful to have an analytical expression for the in- 
verse function, and the following agrees with the exact in- 
verse of our formula to within a typical maximum error of a 
few per cent over the range of interest: 



/nl(2/) = V 



l + (By^ 3 [ g 3 (Q)/ll.i 



2/3\6 1 1 / S 



1 + 0.2Sy + (By 



(24) 



where B = 0.96[g(Q.)] 07 , 7 = 1.03 - 0.39[g(fi)] 0B , and 
8 = 5[g(£l)] ' 3 . A plot of the O-dependent nonlinear function 
is shown in Figure 2. We show only models with zero vacuum 
energy, since the above reasoning shows that all that mat- 
ters is the linear growth-suppression factor g(0,). Note that, 
for spatially flat vacuum-dominated models, the growth sup- 
pression is rather more modest (roughly g(ft) = Q ' 2 ) than 
in models with zero vacuum energy (roughly g(0) = fi"' 7 ). 
Our results (and those of HKLM) apply only to initial con- 
ditions with Gaussian statistics. It is an interesting question 
to what extent the method will also apply to non-Gaussian 
models, an we hope to investigate this elsewhere. Some idea 
of the likely degree of universality may be gained from the 
non-Gaussian models studied by Weinberg & Cole (1992). 
They found that the nonlinear power spectrum was very 
similar for a range of initial models, with the exception 
only of those which were strongly skew-negative. It there- 
fore seems likely that mildly non-Gaussian models such as 
cosmic strings should be treated correctly by the method we 
have given. 

We now have the required means of deducing the initial 
conditions that correspond to a given observed non-linear 
mass spectrum. As an example, we show in Figure 3 the 
initial conditions required to create the canonical correlation 
function £(r) = (r/ro) 1 ' 8 - i.e. A 2 (fc) = (fe/fec) 1 ' 8 , where 
k c — 1.058/ro. For low-density models, the initial conditions 
require an enormous 'bite' to be taken out of the spectrum 
for k between k c and several times k c . The reconstructed 
spectrum tends to be very flat and close to A 2 (fc) = 1 over 
a large range of wavenumber. Conversely, for fi = 1 the 
effects of nonlinearities are not very severe in this case until 
we reach A 2 (fc) > 10. These differences will be important 
when we come to linearize the observed data. 
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4 REDSHIFT-SPACE DISTORTIONS 



bib 2 \5 k \ 2 G(y, 0*1,012), 



(30) 



With the exception of surveys where angular data are depro- 
jected to obtain an estimate of the spatial power spectrum, 
three-dimensional clustering data generally involve redshift 
surveys where the radii are distorted by peculiar velocities. 
There are two effects to consider. On large scales, a linear 
analysis should be valid and we have the anisotropic effect 
noted by Kaiser (1987): 



5 k ^bS k (l + fi//b), 



(25) 



where \jl is the cosine of the angle between the wavevector 
and the line of sight. The function /(fi) ~ Q 6 is the well- 
known velocity-suppression factor due to Peebles, which is 
in practice a function of il m only, with negligible dependence 
on the vacuum density (Lahav et al. 1991). The anisotropy 
arises because mass flows from low-density regions onto high 
density sheets, and the apparent density contrast of the pat- 
tern is thus enhanced in redshift space if the sheets lie near 
the plane of the sky. If we average this anisotropic effect by 
integrating over a uniform distribution of /1, the net boost 
to the power spectrum is 



\S k 



b 2 \S k \ 2 (l + l[f/b] + \[f/bf) 



(26) 



On small scales, this is not valid. The main effect here 
is to reduce power through the radial smearing due to virial- 
ized motions and the associated 'finger-of-God' effect. This 
is hard to treat exactly because of the small-scale velocity 
correlations. A simplified model was introduced by Peacock 
(1992) in which the small-scale velocity field is taken to be 
an incoherent Gaussian scatter with ID rms dispersion a. 
This turns out to be quite a reasonable approximation, be- 
cause the observed pairwise velocity dispersion is a very slow 
function of separation, and is all the better if the redshift 
data are afflicted by significant measurement errors (which 
should be included in a). This model is just a radial convo- 
lution, and so the fe-space effect is 



5k — > 5k exp[-fc 2 // 2 f7 2 /2]. 



(27) 



This effect in isolation gives an average isotropic factor of 



15,1 



Fk erf (ka) 
2 k^ 



(28) 



and produces only mild damping (one power of k at large 
k). 

Some workers (e.g. Fisher et al. 1992; Kofman, Gnedin 
& Bahcall 1993) have combined the above two effects simply 
by multiplying the two power correction factors to achieve a 
total distortion. However, this is not correct: both terms are 
anisotropic in k space and they interfere before averaging: 
(A 2 B 2 ) / (A 2 )(B 2 ). For the present paper, it is also inter- 
esting to consider the case of cross- correlation where each 
of two catalogues gives a different measure of the same un- 
derlying density field. The model for the effect in k space of 
cross-correlation is then the product of two separate factors 
of the above form 



\5 k f hb 2 \5 k \ 2 (l + /A* 2 /6i)(l + /a* 2 /6 2 ) x 
exp[— k 2 /j, 2 (a 2 + cr 2 )/2]. 



(29) 



The overall effect is obtained by averaging over fj,, and looks 
more complicated than it really is: 



where 

y 2 = k 2 (a 2 + a 2 2 )/2 
a = f{Sl)/b 
G(y,on,a 2 ) = 
x/i" erf y 



[3aia 2 + 2(ai + a 2 )y 2 + 4y 4 ] 



° XP / [aia 2 (3 + 2y 2 ) + 2(ai + a 2 )y 2 }. 



(31) 
(32) 



(33) 



This simplifies a little in the case of autocorrelations, where 
indices 1 and 2 are equivalent. The interesting aspect of this 
formula is that the linear boost is lost at large k, where the 
result is independent of Q (as is obvious from the anisotropic 
form: the main contribution at large k comes from small fi). 
The true damping at large k is thus more severe than would 
be obtained by multiplying the power corrections prior to 
angular averaging. The simulations of Gramann, Cen & Bah- 
call (1993) show a good level of agreement with the above 
formula in the autocorrelation case. The result is reassur- 
ingly insensitive to the assumed form for the small-scale ve- 
locity distribution function; if we take an exponential instead 
of a Gaussian, we find the same result at small k: 



G(y,cti,a 2 ) 



^ on + a 2 ot\Qt 2 ' 



(5 



Ql + Ct 2 a 



L02 \ 

7~) 



(34) 



and the large-y limit becomes G — > n/(2 3,/2 y) instead of 
G^^ 2 /(2y). 

In practice, the relevant value of a to choose is approxi- 
mately l/\/2 times the pairwise dispersion a\\ seen in galaxy 
redshift surveys. According to the most recent compilation 
of velocity results by Mo, Jing & Borner (1993), this corre- 
sponds to the figure (adopted hereafter) of 



300 kms~ 



(35) 



To this, we should add in quadrature any errors in mea- 
sured velocities. The relatively low value of this dispersion is 
of course a significant problem for some high-density mod- 
els. Gramann, Cen & Bahcall (1993) argue that redshift- 
space power spectra of CDM models fit observation very 
well, mainly because the predicted pairwise dispersion is so 
high in these models. As we shall see below, such an unreal- 
istically large dispersion would spoil the agreement between 
datasets in real and in redshift space. 



5 POWER-SPECTRUM RECONSTRUCTION 
5.1 Data 

We now apply the above tools to some of the more recent 
results on the clustering power spectrum. We shall consider 
eight distinct sets of data, which fall into several distinct 
classes: 

(i) Real-space clustering of galaxies. Baugh & Efstathiou 
(1993) have applied a deprojection procedure to the an- 
gular clustering of the APM galaxy survey to infer the 
nonlinear power spectrum of optically-selected galaxies 
without redshift-space distortions. This paper considers 
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the large-scale power spectrum, and we have thus used 
the APM data at k < 1 /iMpc -1 only. To allow compar- 
ison with other datasets, we have also set a lower limit 
of k > 0.015 hMpc- 1 . 

(ii) Redshift-space clustering of galaxies. We consider three 
datasets: FKP for IRAS galaxies (the QDOT sample); 
Loveday et al. (1993) for the Stromlo/APM survey; Vo- 
geley et al. (1992) for the CfA survey. The last paper 
quotes results for two separate subsets; we have adopted 
a straight mean of the two sets of data. We have not 
used the IRAS data of Fisher et al. (1992), which is 
systematically lower than that of FKP. As discussed by 
FKP, this seems most likely to be a local sampling ef- 
fect. In any case, it is the deeper QDOT sample used 
by FKP which also appears in cross-correlation analyses 
(see [iv] below). 

(iii) Redshift-space clustering of groups and clusters of 
galaxies. We use the power spectrum for R > 1 Abell 
clusters from Peacock & West (1992) and also radio 
galaxies from Peacock & Nicholson (1991), on the as- 
sumption that the strongly enhanced clustering of these 
latter objects may be attributed to their location in 
moderately rich environments. 

(iv) We also use the cross-correlation between IRAS galaxies 
and Abell clusters or radio galaxies from Mo, Peacock 
& Xia (1993). 

Not all of the above data are available directly in power 
spectrum form. In cases where what is published is a cell 
variance or a measure of £(r), we have used the notion of 
an effective wavenumber, as discussed above and in Peacock 
(1991). The treatment of errors requires some discussion. 
Only FKP give a full realistic error covariance matrix for 
their data; the other datasets give errors ranging from Pois- 
son estimates to field-to-field errors, but with no discussion 
of the independence of the measurements at different k. For 
consistency, we have therefore used a fraction of the FKP 
data, spaced widely enough to be roughly independent. Any 
imprecision in this procedure, plus unrecognised systemat- 
ics, will become apparent once the various datasets are com- 
pared with each other. 

The raw power-spectrum data are plotted in Figure 4. 
There is a wide range of power measured, ranging over per- 
haps a factor 20 between the real-space APM galaxies and 
the rich Abell clusters. We now have to see to what extent 
these measurements are all consistent with one Gaussian 
power spectrum for mass fluctuations. 

5.2 Implications for bias and Q 

The reconstruction analysis has available eight datasets con- 
taining 91 distinct k — A 2 pairs. The modelling has available 
five free parameters in the form of fi and the four bias pa- 
rameters for Abell clusters, radio galaxies, optical galaxies 
and IRAS galaxies (&a, 6r, b , bi). We optimized the model 
by making independent determinations of A 2 (fc) for each 
dataset and then comparing them. This was done in prac- 
tice by dividing the range 0.01 < k < 0.1 ftMpc -1 into 20 
bins, evaluating a weighted mean power and a \ 2 f° r each 
bin. The likelihood of the model is given in terms of the 
summed \ 2 values: 

£ oc exp — x 2 /2- (36) 



At this stage, the question arises of whether the errors are 
realistic, which may be judged from whether the overall x 2 
matches the number of degrees of freedom: in fact, it does 
not. A procedure which ensures the required match is to 
add some constant rms error e in quadrature to the existing 
errors. In practice 

e = 23 per cent (37) 

is required for the best fitting model. Such a fudge is unsat- 
isfactory and indicates a failure of understanding of the data 
errors. However, there are grounds for suspecting that some 
of the published errors are too low, so e is not a surprisingly 
large correction. There may be excessive democracy here, in 
that the formally most accurate datasets are penalized most 
strongly by this procedure. On the other hand, these may 
be the ones most likely to 'detect' small residual system- 
atics; it seems conservative to distribute the blame for any 
small disagreement uniformly. One might also query whether 
this correction should be applied at all k; for many models, 
the disagreement is worst at high k. We shall stick with the 
simplest procedure, since the quoted errors are usually much 
larger at low k. 

Of our free parameters, only two are really important: 
Q and a measure of the overall level of fluctuations. We 
take the IRAS bias parameter to play this latter role. Once 
these two are specified, the other bias parameters are well 
determined - principally from the data at small k, where 
we are in the linear regime. The best-fit values depend only 
very slightly on the two controlling parameters, and for all 
allowed models are close to 

b A : b R : b : bj = 4.5 : 1.9 : 1.3 : 1, (38) 

to within 6 per cent rms. We now show likelihood plots for 
the remaining two parameters, Q and bi. Contours of like- 
lihood are displayed in Figure 5, distinguishing the cases 
fit, = (open) and Q m + Q v = 1 (flat). Two main features 
are visible on these plots: the data appear to demand a sig- 
nificant degree of redshift-space distortion, with the optimal 
model having 

fi ' 6 

-— = 1.0 ±0.2 (39) 

Oi 

in both cases (rms error). Models satisfying this constraint 
in which both Q and b are large are allowed, corresponding 
to models well in the linear regime. However, low-bias mod- 
els appear to be less favoured: for low Q, the best models 
have bi ~ 0.8. For the case of flat models, there is a certain 
bimodality, with the preferred values of bi for Q — 0.1 be- 
ing 0.8 and 0.25. However, the heavily antibiased branch of 
solutions can probably be excluded on other grounds, and 
we ignore it hereafter. At the 90 per cent confidence level, 
this analysis requires fi > 0.14. The various reconstructions 
of the linear power spectrum for the case f2 = bi = 1 are 
shown superimposed in Figure 6, and display an impressive 
degree of agreement. This argues very strongly that what 
we measure with galaxy clustering has a direct relation to 
mass fluctuations, rather than the large-scale clustering pat- 
tern being an optical illusion caused by non-uniform galaxy- 
formation efficiency (Bower et al. 1993). If this were the case, 
the shape of spectrum inferred from clusters should have a 
very different shape at large scales, contrary to observation. 
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The detection of redshift-space distortions is based 
largely on the inclusion of the APM survey, since it is the 
only real-space measurement used here. If this dataset is 
removed from the analysis, small values of $7 0,6 /&i are no 
longer excluded. An upper limit at f2°' 6 /6i < 2 can still be 
set; this comes primarily from the cross-correlation data. 
In real space, the cross-correlation should be the geometric 
mean of the two auto-correlation results. Because of the dif- 
ferent effects of the redshift-space mapping, however, this 
is no longer true when redshift-space distortions become 
large. The observed cross-correlations thus set a limit to how 
strong the distortion can be. Some independent confidence 
in the detection of non-zero distortion can be gained from 
the work of Saunders, Rowan-Robinson & Lawrence (1992). 
They deduced the real-space correlation function for IRAS 
galaxies: £(r) = (r/r ) -7 , with r = 3.78±0.14 ft _1 Mpc and 
7 = 1.57±0.03. If we convert this to a power spectrum, it lies 
lower than the QDOT results of FKP by a factor 1.61 ±0.26 
over the range 0.05ft < k < 0.15/i Mpc -1 . This corresponds 
to 0°' 6 /&i = 0.75 ± 0.25, in good agreement with the figure 
deduced above, and provides independent evidence for the 
detection of significant redshift-space distortion. This lower 
degree of real-space clustering is also in agreement with our 
ratio of 1.3 between optical and IRAS bias factors. The 
Saunders et al. figure for ro predicts ro = 5.3ft _1 Mpc for 
optically-selected galaxies in real space, which is very close 
to the canonical value. IRAS galaxies have a slightly smaller 
value of 7, but this only produces an important change in 
relative power on scales rather smaller than those probed 
here. 

The conclusion that models with Q 6 /b ~ 1 and low 
fi are not allowed stems from the effect of nonlinearities: 
the true level of mass fluctuations in such models would be 
very high. Moreover, going to low $7 increases the effect of 
nonlinearities, as discussed above; this trend is less marked 
for the flat models, which is why low densities are not so 
strongly excluded in that case. It is easy to see how this 
conclusion arises by referring to Figure 6. This shows that 
the linear power spectra inferred from galaxy and cluster 
data agree down to k ~ 0.3ft Mpc -1 , where A 2 ~ 1 in the 
best-fitting case. If we assume a higher normalization, the 
effect of nonlinearities in this case is to add power, so the lin- 
ear reconstruction from galaxy data would become very flat 
at high k (cf. Figure 3). However, this would disagree with 
the cluster data, which would still indicate a steep power 
spectrum, since we have assumed that the clusters give the 
linear result directly. This is a general problem with highly 
evolved models: since nonlinearities change the shape of the 
power spectrum at A 2 ~ 1, and especially so for low densi- 
ties, it requires something of a conspiracy for the nonlinear 
power spectrum to be a featureless power law (see Gott & 
Rees 1975). However, on the present assumptions, extreme 
nonlinear evolution should steepen the galaxy correlations 
faster than those for clusters, and yet they empirically have 
much the same slope. The easiest way of understanding this 
is to say that the degree of nonlinearity is only mild. This is 
certainly an issue which merits further investigation, and a 
detailed simulation of cluster formation in a highly nonlinear 
low-density model would be most valuable. In the meantime, 
it is interesting to note that the constraints we have drawn 
here on density and bias are very similar to those obtained 
in a completely independent way by the POTENT group 



in their analysis of the peculiar-velocity field (Dekel et al. 
1993). 

Table 1 gives the final data for the mean reconstructed 
power spectrum, for the case Q, = bj = 1. The data have been 
averaged in bins of width 0.1 in log 10 (wavenumber) and the 
errors quoted are standard errors. These numbers are plotted 
in Figure 7, and will be compared with models in the next 
Section; as will be shown there, the data are consistent with 
a smooth and featureless power spectrum, despite the small 
size of the errors. One of the pleasant features of our result 
is that the power spectrum is only weakly dependent on 
model parameters. For = 1, the power is not so sensitive 
to b, because in redshift space (the majority of the data) we 
measure 



A 2 «b 2 (l±|[//b]±l[//b] 2 ) 



(40) 



The overall power correction factor thus scales only as b w ^ 7 
for b close to unity. This can be used to rescale our 'standard' 
result to some other desired value of b, given $7 = 1. For low 
densities, an empirical formula for the scaling of the linear 
mass spectrum in the present analysis is 



A, oc il 



(41) 



It is convenient to be able to compare the results here 
with another common measure of the amplitude of linear 
mass fluctuations. This is as: the linear-theory rms density 
contrast when averaged over spheres of radius 8ft -1 Mpc: 



a *= I A (fc) T jkW 



[sin kR — kR cos kR] 2 



(42) 



The squared window function weighting the power spectrum 
is very close to a Gaussian W 2 = exp[— fc 2 i? 2 /5], and so a\ 
is just A 2 (fc) at some effective wavenumber: 



a\ = A 2 (k R ) 



kn = 



Qn + l]/2) 



-i l/(n+3) 



R ' 



(43) 
(44) 



where n is the effective power-law index of the power spec- 
trum. As before, this approximation is within a few per cent 
of the exact integration provided n < 0. On the scales of in- 
terest, the effective index is close to —1.5 and so the effective 
wavenumber for as is k = 0.20. Using the above scalings, we 
get 



(j 8 = 0.75 O" 



(45) 



with a formal rms uncertainty of 13 per cent. 

The significance of 8ft -1 Mpc as a normalization scale is 
that as is of order unity and thus its value can be probed by 
observations of weakly nonlinear structures such as galaxy 
clusters. White, Efstathiou & Frenk (1993) discuss this con- 
straint, and deduce as = 0.57 $7~ 0,56 for spatially flat models 
(although the scaling should be very similar for open mod- 
els), to within a tolerance of roughly ±10 per cent. The 
precise meaning of their uncertainty is hard to quantify, but 
it seems intended to give hard limits, rather than an rms. 
The agreement with our results is very good; the $7 depen- 
dence is steeper, but the disagreement in as is only a factor 
1.4 even for Q. = 0.2. 
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6 POWER-SPECTRUM DATA AND MODELS 
6.1 CDM-like models 

It is interesting to ask if the power spectrum contains any 
features, or whether it is consistent with a single smooth 
curve. In fact, a variety of simple models describe the data 
from Table 1 very well within the errors. Consider the fit- 
ting formula used by Peacock (1991), which is just a break 
between two power laws: 

A 2 ^ ( fc / fc o)" 

{k) = i + ow-g - w 

This works well, with 

fc = 0.29 ± 0.01 ft Mpc -1 (47) 

fci = 0.039 ± 0.002 ft Mpc" 1 (48) 

a = 1.50 ± 0.03 (49) 

(3 = 4.0 ±0.5. (50) 

A value of (3 = 4 corresponds to a scale-invariant spectrum 
at large wavelengths. 

An more physical alternative is the CDM power spec- 
trum, which is A 2 (fc) oc k n+:i T 2 . We shall use the BBKS 
approximation for the transfer function: 

_ ln(l + 2.34g) 
lk ~ 2.Mq X 

[1 + 3.89g + (14.1q) 2 + (5.46g) 3 + (6.7b?) 4 ] -1/4 , (51) 

where q = fc/[Oft 2 Mpc -1 ]. Since observable wavenumbers 
are in units of ft Mpc -1 , the shape parameter is the appar- 
ent value of Oft. This scaling applies for models with zero 
baryon content, but there is an empirical scaling that can 
account for the effect of baryons, and which deserves to be 
more widely known. Figure 8 shows a compilation of CDM 
transfer functions taken from Holtzman (1989). When plot- 
ted against fc/Oft 2 , there is a strong dependence on baryon 
density: high baryon content mimics low CDM density. If we 
instead use the scaling 



Tfc(fe) = T BB Ks(£:/[Oft 2 exp-20 B ]), 



(52) 



then all the curves lie on top of one another to a few per 
cent tolerance. We shall henceforth use the term 'Oft' to 
refer to the BBKS fitting parameter, on the understanding 
that it means the combination Oftexp — 20 B . Our results 
will hence differ slightly from those of Efstathiou, Bond & 
White (1992), who defined a parameter T which is almost 
Oft. Unfortunately, they scaled to a 'standard' CDM model 
with B = 0.03, with the result that T = 1.060ft. 

Fitting the CDM model to our data also results in a 
satisfactory \ 2 an d requires the parameters 



Oft = 0.255 ± 0.017 + 0.32(l/n - 1), 



(53) 



in agreement with many previous arguments suggesting that 
a low-density model is needed. The fit of this and other mod- 
els is illustrated in Figure 9. For any reasonable values of ft 
and baryon density, a high-density CDM model is not viable. 
Even a high degree of 'tilt' in the primordial spectrum (Cen 
et al. 1992) does not help reach the required Oft ~ 0.75. 
The alternatives are either to retain the CDM model, but 
assume that some piece of unknown physics has produced 
a transfer function which looks like a low-density model, or 
to adopt a low density, or to go for something else entirely. 



As far as low densities are concerned, note that the popular 
choice of O = 0.2 (e.g. Kauffmann & White 1992) will over- 
shoot and yield too low values of Oft. More viable alterna- 
tives with high density are either mixed dark matter (Holtz- 
man 1989; van Dalen & Schaefer 1992; Taylor & Rowan- 
Robinson 1992; Davis, Summers & Schlegel 1992; Klypin et 
al. 1993; Pogosyan & Starobinsky 1993), or non-Gaussian 
pictures such as cosmic strings + HDM, where the lack of a 
detailed prediction for the power spectrum helps ensure that 
the model is not yet excluded (Albrecht & Stebbins 1992). 
Mixed dark matter seems rather ad hoc, but may be less so 
if it is possible to produce both hot and cold components 
from a single particle, with a Bose condensate playing the 
role of the cold component (Madsen 1992; Kaiser, Malaney 
& Starkman 1993). However, the shape of the MDM spec- 
trum is not really all that close to the spectrum deduced 
here: it bends much more sharply, and is very flat on small 
scales. At the quasilinear scale k — 0.2ft Mpc -1 , the local 
power-law index for the MDM model is about n = —2.2, as 
opposed to our empirical value n ~ —1.5. If the good fit of 
a low-density CDM transfer function is taken literally, then 
perhaps this is a hint that the epoch of matter-radiation 
equality needs to be delayed. An approximate doubling of 
the number of relativistic degrees of freedom would suffice 
- but this would do undesirable violence to primordial nu- 
cleosynthesis: any such boost would have to be provided by 
a particle which decays after nucleosynthesis. The apparent 
value of Oft depends on the mass and lifetime of the particle 
roughly as 



Oft| 



apparent 



Oft [1 + (m keV T ycars ) ] 



2/3-1-1/2 



(54) 



(Bardeen, Bond & Efstathiou 1987; Bond & Efstathiou 
1991), so a range of masses is possible. Apart from mak- 
ing the observed large-scale structure, such a model yields a 
small-scale enhancement of power which could lead to early 
galaxy formation. Whether the required particle physics is 
at all plausible remains to be seen, but the model is arguably 
the most attractive of those currently available. 

An important general lesson to be drawn from this sec- 
tion is the lack of large-amplitude features in the power 
spectrum. This is a strong indication that collisionless mat- 
ter is deeply implicated in forming large-scale structure. 
Purely baryonic models contain large bumps in the power 
spectrum around the Jeans' length prior to recombination 
(k ~ 0.03Oft 2 Mpc -1 ), whether the initial conditions are 
isocurvature or adiabatic (e.g. Section 25 of Peebles 1993). 
It is hard to see how such features can be reconciled with 
the data. 



6.2 Peculiar velocities 

The mass power spectrum has a direct application in pre- 
dicting the cosmological peculiar velocity field. The 3D rms 
velocity for clumps averaged over some window is 



a 2 = H 2 /(0) 2 / A\k) 



dk 
fc3 



(55) 



so we can use the power spectrum to make a direct prediction 
of this quantity, which is shown in Figure 10 for the case of 
spheres of varying radii. The velocity power spectrum (oc 
fc -2 A 2 (fc)) peaks around the break in the power spectrum 
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at k ~ 0.03/i Mpc -1 , and so the predicted velocities decline 
rapidly for spheres which filter out this scale. 

For Q — 1, the predicted velocities are very reason- 
able. If we model the local group as a sphere of radius 5ft _1 
Mpc, the 3D rms is 680 kms -1 , as against the observed 
600 one-point local measurement (the answer is very insen- 
sitive to the size used to define the local group). Figure 10 
also shows the deduced velocities from the POTENT group 
(Bertschinger et al. 1990) for spheres of radius 40 and 60 
/i _1 Mpc, which also agree well. However, the predictions are 
completely inconsistent with the velocity of 842 kms -1 for 
the local sphere out to 150/i _1 Mpc claimed by Lauer & Post- 
man (1993). The predicted 3D rms for this scale is only 140 
kms -1 . Even if we allow that their weighting scheme might 
reduce the effective radius of their sphere (they weight each 
radial shell equally), there remains a qualitative discrepancy. 
If this results were to be confirmed, it would probably indi- 
cate a large feature in the power spectrum on scales beyond 
those probed here (k < O.Ol/i Mpc -1 ). 

The empirical power spectrum deduced here thus seems 
to agree extremely well with large-scale velocity data. The 
crucial test for Q — 1 models, however, has often been the 
small-scale velocity dispersion. The preferred low-fi/i model 
predicts a pairwise dispersion at lh^ 1 Mpc separation of 
about cry = 550 kms -1 (Mann 1993), which is interestingly 
close to more recent observational data (Mo, Jing & Borner 
1993). 

6.3 CMB anisotropies 

We now relate the measurement of mass fluctuations on 
scales of several hundred Mpc to those implied on larger 
scales from the measurement of CMB fluctuations by the 
COBE team (Smoot et al. 1992). This is a subject which 
has advanced rapidly since the original detection, with a 
more widespread appreciation of the possible contribution 
of gravitational waves to the anisotropy (following the origi- 
nal insight of Starobinsky 1985). We therefore distinguish 
explicitly between scalar and tensor contributions to the 
CMB fluctuations by using appropriate subscripts. The for- 
mer category are those described by the Sachs- Wolfe effect, 
and are gravitational potential fluctuations that relate di- 
rectly to mass fluctuations. For a Gaussian beam of FWHM 
2.35<7, the correlation function of the microwave sky is 



C s (6) = ±- 5^(2* + 1) WlCePe (cos 9), 



(56) 



where Pi are Legendre polynomials, and We = 
exp(— £ a /2). The coefficients Ce are 

Ct = W7T Ifffi) J (M2 C /#o]r 4 A 2 (fc) jl(kR H ) y (57) 

where ji are spherical Bessel functions (see Peebles 1982). 
The length 7? H is the present comoving horizon size 

2c 



Rn — 



QH 

2c 



(Open) 
(Flat) 



(58) 
(59) 



(Vittorio & Silk 1991) and the function g(Q) is the linear 
growth suppression factor relative to = 1, as discussed 
earlier. These formulae strictly apply only to spatially flat 



models, since the notion of a scale-free spectrum is imprecise 
in an open model. Nevertheless, since the curvature radius 
subtends an angle of Q/[2(l — f2) 1//2 ], normalizing to COBE 
in an open model should not be a very bad approximation 
until we reach Q < 0.2. We shall therefore ignore this uncer- 
tainty in what follows. 

In the case of the COBE measurements, the simplest 
and most robust datum is just the sky variance convolved 
to 10° FWHM, i.e. C s (0) in the above expression with a = 
4.25°. This can be converted into an integral over the power 
spectrum multiplied by a window function which is a sum 
over Bessel functions. In practice, it is convenient to have 
a simpler expression for the window, and it turns out that 
this can be achieved to almost perfect accuracy by using a 
small-angle approximation: 

l2 



C s (0) 



W\y) 



9/ 



g 2 m 



4(fc[2c/ J ffo]r 4 A"(fc) W\kR K ] 



dk 
k 



■3S(y)-VUv)]F(w)/{w), 



(60) 
(61) 



where F(x) is Dawson's integral. The terms involving Bessel 
functions correspond to the subtraction of monopole and 
dipole terms. The window function is relatively sharply 
peaked and so the COBE variance essentially picks out the 
power at a given scale. For the case of a = 0.0742 (FWHM 
of 10°), the result is very well fit by 



Cs(0) = 1.665 



Q 2 



[A(ks[2c/H ])- 4 A 2 (k s )] 



(62) 
(63) 



<? 2 (") 

k s R H = 7.29 + 2.19(n-l) 

The observed value is C 1/2 (0) = 1.10 ± 0.18 x 10~ 5 (Smoot 
et al. 1992). For scale-invariant spectra, this corresponds to 
an rms quadropole of Q rm s = 15.0 ± 2.5 /xK. For Q = 1, 
this translates to a normalization of e = 2.6 ± 0.4 x 10~ 5 in 
the notation of Peacock (1991). How well does this ampli- 
tude match on to the clustering observed at 100 Mpc wave- 
lengths? If we stick to asymptotically scale-invariant spectra, 
the agreement is very good. The CDM fit shown in Figure 
9 requires 



3.25 ±0.18 x 10" 



(64) 



This is slightly higher than the COBE measurement, but 
well within experimental error. If the large-scale normaliza- 
tion is forced to be e = 2.6 x 10~ 5 , the best-fitting CDM 
shape changes to Q.h — 0.31. 

For a more general comparison, it is convenient to define 
a reference datum at the largest scale where our data are 
still accurate. From Table 1, we take this to be A 2 (k — 
0.028fe) = 0.0087 ± 0.0023. At this point, there is still some 
curvature in the power spectrum: the flh — 0.25 transfer 
function is Tk = 0.61 and the effective transfer function 
defined by the two power-law formula is Tk = 0.80. We shall 
adopt a compromise T fe =0.70 and hence deduce 



A 2 {k = 0.028/i) = 0.018 ± 0.0023 



(65) 



as our best estimate of the true level of any primordial 
power-law fluctuations on these scales (subject to scalings 
as above if Q 1). We can now use our earlier discussion 
of the COBE data to predict this small-scale fluctuation, ig- 
noring for the moment any gravity-wave contribution. The 
answer is 



A 2 (/c = 0.028/i) = 0.014 exp[3.2(n - 1)] fi~ (K7 



(open) (66) 
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= 0.014 exp[3.2(n - 1)] fi -1 ' 6 (flat). (67) 

Thus, if we adopt Q = 1, there is a very good agreement with 
scale-invariance: n = 1.08 ± 0.04. Conversely, tilted models 
do not match large and small scales very well: for n — 0.7, 
the predicted power near 100 Mpc is too small by a factor 3. 
Things get worse if gravity waves are included: a prediction 
of many inflationary models is that 

g£~6(l-n) (68) 

(e.g. Liddle & Lyth 1991; Lidsey & Coles 1991; Lucchin, 
Matarrese & Mollerach 1992; Souradeep & Sahni 1992), 
which decreases the predicted small-scale power by a fur- 
ther factor 2.8 for n — 0.7, making a total mismatch of a 
factor 8. It is inconceivable that our analysis of the 100- 
Mpc scale power could be in error by this amount. Thus, 
although tilted models may be attractive in removing the 1- 
degree 'bump' in the predicted microwave sky (Crittenden 
et al. 1993) and allowing consistency with intermediate-scale 
CMB experiments, it seems implausible that this can be the 
correct solution, at least if f2 = 1. To allow a tilted model 
with n — 0.7, we need f2 ~ 0.06 or 0.3 respectively in the 
open and flat cases. 



7 SUMMARY 

We have analysed a compilation of recent measures of galaxy 
clustering, under the assumption of underlying Gaussian 
mass fluctuations. We have presented new methods for deal- 
ing analytically with the modifying effects of nonlinear evo- 
lution and redshift-space distortions, and their effect on the 
power spectrum. Applying these methods to the data leads 
to a consistent determination of the linear mass spectrum, 
with the following properties. 

(i) The relative bias factors for Abell clusters, radio galax- 
ies, optical galaxies and IRAS galaxies must be in the 
ratios b A : &r : b : bi = 4.5 : 1.9 : 1.3 : 1, to within 6 
per cent rms. 

(ii) The data require a significant degree of redshift-space 
distortion: Q. os /&, = 1.0 ± 0.2. 

(iii) Low values of £7 and bias are disfavoured because non- 
linear evolution would spoil the agreement in shape be- 
tween galaxy and cluster power spectra. Both this and 
the previous conclusion are in good agreement with in- 
dependent studies based on peculiar velocity fields. 

(iv) The linear power spectrum is smooth and featureless, 
and is well described by a zero-baryon CDM model with 
Qh = 0.25. 

(v) The amplitude of 100-Mpc power matches well onto 
that inferred from COBE provided the primordial spec- 
trum was close to scale- invariant. Tilted models that 
postulate a dominant gravity-wave CMB component are 
difficult to reconcile with our data. 
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FIGURE CAPTIONS 

Figure 1 The nonlinear evolution of power spectra ac- 
cording to the HKLM method, and the approximate direct 
alternative presented here. The dashed lines show the input 
linear power spectra, the solid lines show the result of nu- 
merically integrating the HKLM method and the one-step 
solution is shown dotted. Panel (a) shows a variety of COBE- 
normalized CDM models (with Clh = 0.2, 0.3, 0.4, 0.5; while 
(b) shows fift=0.5 CDM filtered with different Gaussian win- 
dows (Rf = 0.25, 0.5, 0.75, 1 ft -1 Mpc), approximating the 
effect of warm dark matter. As expected, our method fails 
at very high k, where the linear CDM spectra become very 
flat and the linear WDM spectra cut off, but is otherwise 
excellent. Note that the effect of the WDM cutoff is only felt 
at very large k: WDM is not the explanation for the shape 
of the power spectrum around k = 0.1ft. 
Figure 2 The generalization of the HKLM function relat- 
ing nonlinear power to linear power (or £) , as in the original 
method. The lowest curve is the original HKLM function 
for CI = 1; low densities give a greater nonlinear response. 
We show the fitting formula for open models only, but what 
matters in general is just the ^-dependent linear growth 
suppression factor. 

Figure 3 The inverse of the generalized HKLM procedure, 
as applied to a power-law power spectrum A 2 (fc) = (fc/fc c ) 1,8 
(in correlation-function terms, ro = 0.945/fc c ). Open models 
with CI = 1, 0.5, 0.3, 0.2, 0.1 are considered. Note that the 
effects of nonlinearities in this case are rather small for high 
densities, but for low densities the required linear initial con- 
ditions are very flat for k > k c . This flat case is one where our 
approximate inversion of the HKLM procedure will not be 
perfect (cf. Figure 1); nevertheless, any errors will be small 
in comparison with the systematic feature in the power spec- 
trum required around A 2 ~ 1. It clearly requires something 
of a conspiracy to achieve a scale- free nonlinear spectrum in 
a low-density model. 

Figure 4 The raw power-spectrum data used in this anal- 
ysis. All data with the exception of the APM power spec- 
trum are in redshift space. The two lines shown for refer- 
ence are the transforms of the canonical real-space correla- 
tion functions for optical and IRAS galaxies (ro = 5 and 
3.78 ft -1 Mpc and slopes of 1.8 and 1.57 respectively). 
Figure 5 Contours of relative likelihood based on the de- 
gree of agreement of the various estimates of linear power 
spectra. At each (Ct,bi) point, the other bias factors have 
been optimized. We distinguish the cases Cl v = (open) 
and Clm + Cl v — 1 (flat). Contours are plotted at what would 
be the 50, 90, 95, 99, 99.5 per cent confidence levels in a 
two-dimensional Gaussian (i.e. Aln£ = 0.69, 2.3, 3.0, 4.6, 
5.3). 

Figure 6 The power-spectrum data from Figure 4, indi- 
vidually linearized assuming CI = bi = 1. There is an excel- 
lent degree of agreement, particularly in the detection of a 
break around k = 0.03ft. 

Figure 7 The linearized data of Figure 6, averaged over 
bins of width 0.1 in log 10 k. This plot assumes CI = bi = 1; 
for lower densities the power increases slightly, as described 
in the text. 

Figure 8 A set of CDM transfer functions, using the fit- 
ting formulae of BBKS for zero baryon content, and those of 
Holtzman (1989) for Cl B = 0.01, 0.03, 0.05, 0.1. Models with 
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h = 1 are shown as solid lines, h — 0.5 are dotted. When 
plotted (a) against k/Qh 2 , the varying baryon content causes 
variations in the curves, with higher f2 B producing greater 
suppression of power. However, this can be scaled away 
(b) by plotting against the combination k/[Q,h 2 exp — 2fi B ]- 
In these terms, the CDM transfer function has a univer- 
sal shape, which can be described by the zero-baryon for- 
mula of BBKS. The scaling becomes noticeably imperfect 
for Q B > 0.3, but is very good for the models plotted here. 
Figure 9 The averaged linear power-spectrum data of 
Figure 7, compared to various CDM models. These as- 
sume scale-invariant initial conditions, with the same large- 
wavelength normalization. Different values of the fitting pa- 
rameter Q,h = 0.5, 0.45, . . . 0.25, 0.2 are shown. The best-fit 
model has Qh = 0.25 and a normalization which is 1.5a 
higher than COBE if Q = 1 and gravity-wave anisotropics 
are negligible (e = 3.25 x 10" 5 ). 

Figure 10 The predicted 3D rms velocity of spheres as a 
function of radius, assuming Q — 1. This is based on the 
two power-law fitting formula for the power spectrum. For 
low densities the velocities are reduced, but by less than 
the normal fi 0,6 factor, because the inferred mass fluctua- 
tions rise in that case (see text). The plotted points are the 
local group motion assigned to a radius of 5/i _1 Mpc, and 
motion of larger spheres taken from the POTENT group 
(Bertschinger et al. 1990). Although the local-group motion 
is very well determined, it is assigned a fractional error of 
1/V6 to allow for fluctuations in the 3D rms velocity seen 
by different observers. 
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